A Novel Architecture for Floating-Point Multiply-Add-Fused Operation with Reduced Latency

نویسندگان

  • Haiping Sun
  • Minglun Gao
چکیده

The multiply-add-fused (MAF) operation is very important in many scientific and engineering applications. In this paper, a novel architecture for the MAF operation is presented. According to the fact that some steps of MAF operation are mutually exclusive, the composing steps are organized into two data-paths, and each data-path only contains the indispensable computation steps. One takes effect when the effective subtraction occurs, and the other takes effect when the effective subtraction does not occur. The difference between execution time of the two datapaths is one beat, so the average latency is shortened. Furthermore, the dual adder combined with rounding is invoked by the two data-paths in different beats, therefore it may be shared by the two data-paths and the circuit area may be saved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Paper Title Low Cost and Low Power Floating-point Fused Multiply-Add Unit Design with Proxy Bits and Weighted 2-Level Booth Encoding

With the appearance of high performance mobile devices, low cost and low power consumption have become important issues in high performance processors. To meet the needs, low cost and low power floating-point fused multiply-add unit is proposed in this paper. According to the area and power consumption analysis, the multiplication part in fused multiplyadd operation accounted for most power con...

متن کامل

Floating-Point Single-Precision Fused Multiplier-adder Unit on FPGA

The fused multiply-add operation improves many calculations and therefore is already available in some generalpurpose processors, like the Itanium. The optimization of units dedicated to execute the multiply-add operation is therefore crucial to achieve optimal performance when running the overlying applications. In this paper, we present a single-precision floating-point fused multiply-add opt...

متن کامل

Impact on Performance of Fused Multiply-Add Units in Aggressive VLIW Architectures

Loops are the main time consuming part of programs based on floating point computations. The performance of the loops is limited either by recurrences in the computation or by the resources offered by the architecture. Several general-purpose superscalar microprocessors have been implemented with multiply-add fused floating-point units, that reduces the latency of the combined operation and the...

متن کامل

Design and Analysis of Online Arithmetic Operators for Streaming Data in FPGAs

Online addition and multiplication of streaming data is of prime importance in real time processing of digital signals.The choice of data representation varies from fixedpoint for improved precision and reduced complexity to floatingpoint for increased dynamic range. The bit level granularity of FPGAs permits the choice of standard and nonstandard number representations for hardware accelerator...

متن کامل

Leading-Zero Anticipator (LZA) in the IBM RISC System/6000 Floating-Point Execution Unit

This paper presents a novel technique used in the multiply-add-fused (MAF) unit of the IBM RlSC System/6000* (RS/6000) processor for normalizing the floating-point results. Unlike the conventional procedures applied thus far, the so-called leading-zero anticipator (LZA) of the RS/SOOO carries out processing of the leading zeros and ones in parallel with floating-point addition. Therefore, the n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003